MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis

نویسندگان

چکیده

Recently, self-supervised pre-training has advanced Vision Transformers on various tasks w.r.t. different data modalities, e.g., image and 3D point cloud data. In this paper, we explore learning paradigm for mesh analysis based Transformers. Since applying Transformer architectures to new modalities is usually non-trivial, first adapt processing, i.e., Mesh Transformer. specific, divide a into several non-overlapping local patches with each containing the same number of faces use position patch’s center form positional embeddings. Inspired by MAE, how Transformer-based structure benefits downstream tasks. We randomly mask some feed corrupted Then, through reconstructing information masked patches, network capable discriminative representations Therefore, name our method MeshMAE, which can yield state-of-the-art or comparable performance tasks, classification segmentation. addition, also conduct comprehensive ablation studies show effectiveness key designs in method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variational Autoencoders for Deforming 3D Mesh Models

3D geometric contents are becoming increasingly popular. In this paper, we study the problem of analyzing deforming 3D meshes using deep neural networks. Deforming 3D meshes are flexible to represent 3D animation sequences as well as collections of objects of the same category, allowing diverse shapes with large-scale non-linear deformations. We propose a novel framework which we call mesh vari...

متن کامل

Mesh-based Autoencoders for Localized Deformation Component Analysis

Spatially localized deformation components are very useful for shape analysis and synthesis in 3D geometry processing. Several methods have recently been developed, with an aim to extract intuitive and interpretable deformation components. However, these techniques suffer from fundamental limitations especially for meshes with noise or large-scale deformations, and may not always be able to ide...

متن کامل

LSTM Autoencoders for Dialect Analysis

Computational approaches for dialectometry employed Levenshtein distance to compute an aggregate similarity between two dialects belonging to a single language group. In this paper, we apply a sequence-to-sequence autoencoder to learn a deep representation for words that can be used for meaningful comparison across dialects. In contrast to the alignment-based methods, our method does not requir...

متن کامل

Analysis of digitized 3D mesh curvature histograms for reverse engineering

Today, it has become more frequent and reasonably easy to digitize the surface of 3D objects. However, the obtained results are often inaccurate and noisy. In this paper, we present an efficient method to analyze a curvature histogram from a digitized 3D surface using a real object. Moreover, we propose to use the curvature histogram analysis for many steps of a reverse engineering process, whi...

متن کامل

Gaussian Copula Variational Autoencoders for Mixed Data

The variational autoencoder (VAE) is a generative model with continuous latent variables where a pair of probabilistic encoder (bottom-up) and decoder (topdown) is jointly learned by stochastic gradient variational Bayes. We first elaborate Gaussian VAE, approximating the local covariance matrix of the decoder as an outer product of the principal direction at a position determined by a sample d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20062-5_3